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<110> BENOIT, Patrick 

Schwartz, Bert rand 
BRANELLEC, Didier 
CHIEN, Kenneth R. 

<120> SEQUENCES UPSTREAM OF THE CARP GENE, VECTORS CONTAINING 

THEM AND USES THEREOF ^ 

<130> 03806.0530-00000 

<140> 

<141> 2001-12-07 
^<160> 5 

a 

|)<170> Patentln Ver. 2.1 

fli<210> 1 

Ui<211> 2358 

Ul<212> DNA 

%j<213> Mus musculus 

U<400> 1 

ryggatcctttc atgtttaaca atatcaaccc taacccaagg ggaacagcct gcctgacagt 60 
%jggctttgcca cccatgaata cttcctagtc tagtccgttt gtgaaactca gcccatccca 120 
u'acacttctgc aagccccatc ctctacaagg tgctcattgg gaatttcctg gagcttctct 180 
ttcaggatca gcctgattct agggcagcag ttctcaacct gggggcctcg acccctttgg 240 
gggaatcaaa cgacccttta caggggtcac atatcatcta tcctatatgt caggtattta 300 
cattacgatt cgtaacagta gcaaaattac aggtatgaaa tagcaatgaa ataattttat 360 
gattgaaggt caccacaaca tgaggccgcc acactgttct agagaaaaat cacctgggtg 420 
gggaaaggtt tgggaaagcc tttctgtcca ttcttcattc ttcaaagtga tgtgttcaca 480 
gaaagccttt cagctgttct gctggggctc ttagtaagtc tgagtaggaa ctgtatgtac 540 
caggtctgct tcttatgggt ggagccaaga cgcatcgtgg gtggagcgaa gacgcaacct 600 
caccttctag ctctgcatcc atagcaagta gcctaatgtt tctgtgtcta ggtgtcatct 660 
ctgtgaatcg agatccttgg ccttgcttga attagggagg cacaaaatac tcagagattc 720 

1 



- aagactgctc agcagcccag agtccttcct caaaggaaag gtctcaactc tcagcccccc 780 
ttagctctga gtcaggcctg gaacaaacgg ccacaggaat gagaaaagct gccatagctg 840 
cttgtcactt caagaggtca aagaaaatag tgttaaccat gaaaacgaga agaccaacag 900 
ttatccattg atagcgtctc aggacagata ggacagagag aacactagga gaggggaacc 960 
cacgaaggac aaggtattag tgtgttggtt ttcagggcaa tgtcttgtac tgaagattct 1020 
agaaacacaa tttgctggtt gaacagctga agtggggtgg gggttcttac cccatgttca 1080 
tggaagggtg agtgaggaga gacagatata tgatggccag cataacaaac atacacaaca 1140 
ccctaattaa cacttccctc ttctactgac acccccttca ctctcctctt tcataaaaaa 1200 
taaaaaaagt attttatgtg gctcttacga tagaatcttt cctcgaacta taaaaagatc 1260 
taaatattta tatttttcac attttaatat cttagcgatg acaagccaga aacaagtatt 1320 

Hjttttgcctct ctcaacagca aagcttgggg cctttttgtt tccgtgttag gaatagaaca 1380 

□cgagagcccc gtgtatctag gcagatgctc tatcattagc ccatgagtct ccagcctcag 1440 

Q 

^lacgcacattt ttctcgggct ctcttaagct tttcccacag cattgggaaa ctttactgac 1500 

UJ 

UJagcatccaag ttgtgcttct gctaagaact ggactcacat ctctctgtgc atcacttcgg 1560 
/cccgttttgg ggtagatcct ctgattagcc ttcagattta gaacacggtg agcctgtggt 1620 
mgcactaatta tggccagtga caccatagag tcaaagtgca ttactgaatg ctttcaattt 1680 

r"j 

"jctcctaatgc tggtacgatg gcatgtcaca gggccatttt agctgcagac atcactccag 1740 

ri 

i^agaattccaa acagatagag acaagtggca cccagaccca tctccttccc ctcgggctga 1800 
ttatccccag aaataggatg tcccaaagca acacttccca gccaactgga gtgctgataa 1860 
gtccagttat cagaaagata tggctgtaag tgtgatgcac agtgcttgca ttttcttgat 1920 
acgttagtca tatgagagct gacaaagaag gaaaaagagc agcgatgtgg tgcaatatta 1980 
acaggcagct gtcccctggc ttcccgatac gtgggatgac tcgcattgct gagcggtgtg 2040 
gtcactgcca aaggaatgac cctctcacat ttcttcctga ttcgcatacg ccgcggccag 2100 
cttgtcatct ccctcttggg cttcccagac actaagtctg gaatgaaaat tcacctgcct 2160 
ctgaattggc cactggtggg ggcaggggtg tgacttggct tcccaggctg gaagattatc 2220 
tcacccagcc ctagctatat aacgggctgg tgtggagggg ctccacaggg ccagttccag 2280 
gggttcatcc acaagagaga aaaacataga ctcgaggtct agggagcttg catgcctgca 2340 



ggtcggaggc caccatgg 



2358 



<210> 2 
<211> 2074 
<212> DNA 

<213> Homo sapiens 
<400> 2 

ctgcagcaag ttacttaatg 
gtcttgctcc aacttcgagg 
catcccacag tccttccccc 
gcctggaaca aaaaggcata 

|..t, gatgaagaga ccaatgaaaa 

i") 

□ ataggagcta tacaaagaag 
jlj gaaacattcc atatattaaa 

I,] aaaatagtga tagctgtgtc 

\\ 

' cctttaagat agaatattag 
I"! ctctttgtca atgttttgtc 

j ^accatccact gactgagcat 

\j 

□gggtgagttt ggggagaaat 

ccctccattc acaattccct 

agtttttcct gaaactataa 

ttagaaacca gaaatagaga 

agccacgtgt ctcaaatctt 

attcctacga gttaccataa 

tttctgacaa gctcccacag 

tgacctaatc agagtcctgc 

tagtatttgt ggtagagatg 

gctaagcaat cttcctgtct 



ttttttgcct cagcatcctc 
gcatggacag ctctgggatt 
aaacacttct cctcctaata 
cgaaatggta gaaaaagtgt 
tagtaatgac tctgtttgct 
attagcatgg actctgtgca 
aataaataaa taataaagag 
catctcaaag aaaagcccag 
gagaccggaa catatgatac 
ttggggtggg gagtcgatgt 
tcaaggggca agaggagaat 
agacacacaa aggtcaaaca 
tctcccattc ttctctcctg 
aaataccccc agtatgttta 
ccttttcaac ccttccggaa 
gatgcatcag aatcatctgg 
atcaactcag aattccctgg 
gtgattcctt tccccacagc 
cattgctaat atctggtctc 
ggattttgcc atgttgccca 
ctgcctccca aaatgttggg 



tctgtaaaat gagagcatta 60 
tcatatccaa gacccttaaa 120 
cctccctcag tttgggtcag 180 
ccatgactac ttctgactta 240 
tcagcaggac atatactaaa 300 
agaatgacac acaaatttgt 360 
aaaaggaaaa aattaaaaag 420 
gagatttcct ttatttaccc 480 
aggaggtact gggagggtcc 540 
cttctcaaag tttcagaaac 600 
ggcagccaca tttgttgatt 660 
taacttccta attaacactt 720 
tcttttacts akaraaaccc 780 
cataatttac acctcaaaga 840 . 
gcaaagtgca ttatccctcc 900 
gtgctttkaa attcaagatg 960 
agtggggcca gggatctgta 1020 
atttgagaac ttcagctcaa 1080 
atttttbtca tatatatata 1140 
ggctagtatt gaactcctaa 1200 
attacaggtg taagccactg 1260 



cacccggctg atagctggtt tcatttactc tatttcttga ccactctgat ccattttgaa 1320 
gtaaaaatgc tccaattatt atgctgtttt agaacacggt aagcatgtca tgtgctaatg 1380 
gccagtgaca tcataaaaga aaagtgcatt actgaatgct ttcaatgtct tataatgatg 1440 
gtaaggtggc atgtcatggg gcctatttag cccagacatc actccaaaga attccaaaca 1500 
gatatagaca agtgccttta gggcccagat cccttcccct caggctgttt acccagggaa 1560 
taggatgtcc tgggacaagt ttcccctaag tgaagtgttg ataagtctgc ttatcagaaa 1620 
gatattactg ggggtgtgat atgtagggca tctacatttt cttgataggt agtcatatga 1680 
aagctgacaa agaaaaaaag ggcagtgatg tggtgcaatg tcaacagaca gctgtcccct 1740 
gactcttgac aaataggatg acttgcattg ctgagcgatg tgatcaccac caaaggaatg 1800 
gccctctcac atttcttcct gattcacata ttcagcaggg ttagcttgtc ctcccctccc 1860 
!===tcttcagctt cccagacact gagtctggaa tgaaaattca cctgcctctg agttggctcc 1920 

: 3 

ptaatgggggc gggagtgtta cttcggttcc caggttggaa gattatctca cccggcccca 1980 

□ 

yigctatataag ctgaccggtg tggaggggcc cagcagggcc aactccaggg attccttcca 2040 

^cgacagaaaa acatacaaga ctccttcagc caac 2074 

H 

! :;x2io> 3 

!<211> 750 
"<212> PRT 
;j<213> Homo sapiens 

'SB? 

H<400> 3 

Met Gly Glu Thr Leu Gly Asp Ser Pro lie Asp Pro Glu Ser Asp Ser 
15 10 15 

Phe Thr Asp Thr Leu Ser Ala Asn lie Ser Gin Glu Met Thr Met Val 
20 25 30 

Asp Thr Glu Met Pro Phe Trp Pro Thr Asn Phe Gly lie Ser Ser Val 
35 40 45 

Asp Leu Ser val Met Glu Asp His Ser His Ser Phe Asp lie Lys Pro 
50 55 60 

Phe Thr Thr val Asp Phe Ser Ser lie Ser Thr Pro His Tyr Glu Asp 
65 70 75. 80 

lie Pro Phe Thr Arg Thr Asp Pro Val Val Ala Asp Tyr Lys Tyr Asp 

85 90 95 

4 



Leu Lys Leu Gin 
100 

Pro Pro Tyr Tyr 
115 

Glu Pro Ser Asn 
130 

Lys Ala Ser Gly 
145 

Gly Phe Phe Arg 



Asp Leu 



Cys Arg 



Q 

□ Arg 

Ul 

Hi He 
""225 

f.Ala 

jflffb 



iU 



"■i 

Q 



tThr 



'Ser 



Phe 
210 

Ser 
Leu 
Lys 
Pro 



Asn Cys 
180 

Phe Gin 
195 

Gly Arg 
Ser Asp 
Ala Lys 



Ala Lys 
260 

Phe val 

275 



Lys lie 
290 



Lys Phe 
lie Arg 
val Gin Glu lie 



Glu Tyr 

Ser Glu 

Ser Leu 

Phe His 
150 

Arg Thr 
165 

Arg lie 

Lys Cys 

Met Pro 

lie Asp 
230 

His Leu 
245 

Ala Arg 
lie Tyr 
Lys His 



Gin Ser Ala lie 
105 

Lys Thr Gin Leu 
120 

Met Ala lie Glu 
135 

Tyr Gly Val His 



lie Arg Leu Lys 
170 

His Lys Lys Ser 
185 

Leu Ala val Gly 
200 

Gin Ala Glu Lys 
215 

Gin Leu Asn Pro 



val Ala 
305 



lie Phe 
310 

Thr Glu 

325 



Asp Gin 
Met Leu 
lie Ser Glu Gly Gin Gly 



Leu Asp Leu Asn 
340 

lie lie Tyr Thr 
355 



Tyr Asp Ser Tyr 
250 

Ala lie Leu Thr 
265 

Asp Met Asn ser 
280 

lie Thr Pro Leu 

295 

Gin Gly Cys Gin 



Tyr Ala Lys Ser 
330 

Val Thr Leu Leu 
345 

Ala Ser Leu Met 
360 

Phe Met Thr Arg 



Lys val 
Tyr Asn 



Cys Ar 
14 



Ala Cys 
155 

Leu lie 

Arg Asn 

Met Ser 

Glu Lys 
220 

Glu Ser 
235 

lie Lys 
Gly Lys 
Leu Met 



Gin Glu 
300 

Phe Arg 
315 



Glu Pro Ala Ser 
110 

Lys Pro His Glu 
125 

val Cys Gly Asp 



Glu Gly Cys Lys 
160 

Tyr Asp Arg Cys 
175 

Lys Cys Gin Tyr 
190 

His Asn Ala lie 
205 

Leu Leu Ala Glu 



Ala Asp Leu Arg 
240 

Ser Phe Pro Leu 
255 

Thr Thr Asp Lys 
270 

Met Gly Glu Asp 
285 

Gin Ser Lys Glu 



Ser val Glu Ala 
320 



lie Pro 
Lys Tyr 
Asn Lys 
Glu Phe Leu Lys Ser Leu 



Gly Phe val Asn 
335 

Gly Val His Glu 
350 

Asp Gly val Leu 
365 



Arg 
385 

Lys 
Ala 
Pro 
Leu 



Gin 

465 

H=Leu 
lieu 

'•SSS 

jJI 

LiJ, 

ui Lys 

f Met 

ru 

QgIu 
-J545 

a 

MArg 



Pro 
Tyr 
Lys 

Phe 

625 

Phe 



370 

Lys Pro Phe Gly 



Phe Asn 

val lie 

He Glu 
435 

Lys Leu 
450 

Lys Met 

Gin val 

Gin Glu 

Thr Thr 
515 

Met Gly 
530 

Gin Ser 

Ser val 

Gly Phe 

Gly val 
595 

Asp Gly 
610 

Leu Lys 
Glu Phe 



Ala Leu 
405 

lie Leu 
420 

Asp lie 

Asn His 

Thr Asp 

lie Lys 
485 

lie Tyr 
500 

Asp Lys 

Glu Asp 

Lys Glu 

Glu Ala 
565 

Val Asn 
580 

His Glu 
Val Leu 
Ser Leu 



Ala Val 
645 



375 

Asp Phe 
390 

Glu Leu 

Ser Gly 

Gin Asp 

Pro Glu 
455 

Leu Arg 
470 

Lys Thr 

Lys Asp 

Ser Pro 

Lys lie 
535 

Val Ala 
550 

Val Gin 

t 

Leu Asp 
lie lie 



lie Ser 
615 

Arg Lys 
630 



Met Glu Pro Lys 
395 

Asp Asp Ser Asp 
410 

Asp Arg Pro Gly 
425 

Asn Leu Leu Gin 
440 

Ser Ser Gin Leu 



Gin lie Val Thr 
475 

Glu Thr Asp Met 
490 

Leu Tyr Ala Trp 
505 

Phe val lie Tyr 
520 

Lys Phe Lys His 



lie Arg lie Phe 
555 

Glu He Thr Glu 
570 

Leu Asn Asp Gin 
585 

Tyr Thr Met Leu 
600 

Glu Gly Gin Gly 



Pro Phe Gly Asp 
635 



Lys Phe Asn Ala Leu Glu 

650 



380 

Phe Glu Phe 

Leu Ala lie 

Leu Leu Asn 
430 

Ala Leu Glu 
445 

Phe Ala Lys 
460 

Glu His Val 

Ser Leu His 

Ala lie Leu 
510 

Asp Met Asn 
525 

lie Thr Pro 
540 

Gin Gly Cys 

Tyr Ala Lys 

Val Thr Leu 
590 

Ala Ser Leu 
605 

Phe Met Thr 
620 

Phe Met Glu 
Leu Asp Asp 



Ala val 
400 

Phe lie 
415 

val Lys 

Leu Gin 

Leu Leu 

Gin Leu 
480 

Pro Leu 
495 

Thr Gly 
Ser Leu 
Leu Gin 

Gin Phe 

560 

Ser lie 
575 

Leu Lys 
Met Asn 
Arg Glu 



Pro Lys 
640 

Ser Asp 
655 



Leu Ala lie Phe lie Ala val lie lie Leu Ser Gly Asp Arg Pro Gly 
660 665 670 

Leu Leu Asn Val Lys Pro lie Glu Asp He Gin Asp Asn Leu Leu Gin 
675 ' 680 685 

Ala Leu Glu Leu Gin Leu Lys Leu Asn His Pro Glu Ser ser Gin Leu 
690 695 700 

Phe Ala Lys Leu Leu Gin Lys Met Thr Asp Leu Arg Gin lie val Thr 
705 710 715 ~ 720 

Glu His Val Gin Leu Leu Gin val lie Lys Lys Thr Glu Thr Asp Met 

725 730 735 

Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tyr 
740 745 750 

,„<210> 4 
~ 5 <211> 30 
q<212> DNA 

:^<213> Artificial sequence 

in 

J ;<400> 4 

a *GGCGATTTAA ATAATGTAGT CTTATGCAAT 30 

UJ 

N=<210> 5 

m<2ii> 31 

u<212> DNA 

^j<213> Artificial sequence 

a 

i»K400> 5 

GGGGTCTAGA AGGTGCACAC CAATGTGGTG A 31 



7 



